Search CORE

27 research outputs found

On Objective Measures of Rule Surprisingness

Author: C. Glymour
E. H. Simpson
F. Gebhardt
F.J. Provost
J.A. Major
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1998
Field of study

Most of the literature argues that surprisingness is an inherently subjective aspect of the discovered knowledge, which cannot be measured in objective terms. This paper departs from this view, and it has a twofold goal: (1) showing that it is indeed possible to define objective (rather than subjective) measures of discovered rule surprisingness; (2) proposing new ideas and methods for defining objective rule surprisingness measures

CiteSeerX

Crossref

Kent Academic Repository

Inducing safer oblique trees without costs

Author: Althoff K.
Bennett K.P.
Bennett K.P.
Berry M.
Blake C.
Bradford J.
Breiman L.
Cohen R.
Domingos P.
Elkan C.
Elomaa T.
Fan W.
Grefenstette J.
Knoll U.
Kolodner J.
Morrison D.
Norusis M.
Nunez M.
Pazzani M.
Provost F.J.
Provost F.J.
Quinlan J.R.
Quinlan J.R.
Sunil Vadera
Tan M.
Ting K.
Turney P.
Vadera S.
Publication venue: 'Wiley'
Publication date: 01/09/2005
Field of study

Decision tree induction has been widely studied and applied. In safety applications, such as determining whether a chemical process is safe or whether a person has a medical condition, the cost of misclassification in one of the classes is significantly higher than in the other class. Several authors have tackled this problem by developing cost-sensitive decision tree learning algorithms or have suggested ways of changing the distribution of training examples to bias the decision tree learning process so as to take account of costs. A prerequisite for applying such algorithms is the availability of costs of misclassification. Although this may be possible for some applications, obtaining reasonable estimates of costs of misclassification is not easy in the area of safety. This paper presents a new algorithm for applications where the cost of misclassifications cannot be quantified, although the cost of misclassification in one class is known to be significantly higher than in another class. The algorithm utilizes linear discriminant analysis to identify oblique relationships between continuous attributes and then carries out an appropriate modification to ensure that the resulting tree errs on the side of safety. The algorithm is evaluated with respect to one of the best known cost-sensitive algorithms (ICET), a well-known oblique decision tree algorithm (OC1) and an algorithm that utilizes robust linear programming

University of Salford Institutional Repository

Crossref

Inductive policy: The pragmatics of bias selection

Author: A. Samuel
A.P. Danyluk
B. Grosof
Bruce G. Buchanan
C. Schaffer
C.E. Brodley
D. Cohen
D. Haussler
D. Tcheng
F. Provost
F. Provost
F.J. Provost
F.J. Provost
F.J. Provost
Foster John Provost
G. Jong De
G. Pagallo
H. Almuallim
H. Rabinowitz
H.A. Simon
J. Catlett
J. McCarthy
J. Quinlan
J. Quinlan
J. Wirth
J.R. Slagle
K. Kira
L. Breiman
L. Holder
L. Rendell
L. Rendell
L. Uhr
M. Jardins des
N. Littlestone
P. Utgoff
P.E. Utgoff
R. Michalski
R. Musick
R. Rymon
R.C. Holte
R.E. Korf
S. Clearwater
S. Clearwater
S. Russell
S. Russell
T. Dietterich
T. Mitchell
T. Mitchell
V. Lifschitz
W. Buntine
W. Spears
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1995
Field of study

Crossref

Inter-comparison of the g-, f- and p-modes calculated using different oscillation codes for a given stellar model

Author: A. Miglio
A. Moya
A. Moya
A.I. Boothroyd
D.O. Gough
F.J. Rogers
G. Houdek
H. Shibahashi
I. W. Roxburgh
I.W. Roxburgh
J. C. Suárez
J. Christensen-Dalsgaard
J. Christensen-Dalsgaard
J. Christensen-Dalsgaard
J. Christensen-Dalsgaard
J. Christensen-Dalsgaard
J. Montalbán
J. Provost
J. Provost
J.B. Fixler
J.C. Suarez
M. J. P. F. G. Monteiro
M. Suran
M. Suran
M.J.P.F.G. Monteiro
M.J.P.F.G. Monteiro
P. Brassard
P. Brassard
P. Morel
R. Scuflaire
R. Scuflaire
S. Charpinet
Y. Lebreton
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/11/2007
Field of study

In order to make astroseismology a powerful tool to explore stellar interiors, different numerical codes should give the same oscillation frequencies for the same input physics. This work is devoted to test, compare and, if needed, optimize the seismic codes used to calculate the eigenfrequencies to be finally compared with observations. The oscillation codes of nine research groups in the field have been used in this study. The same physics has been imposed for all the codes in order to isolate the non-physical dependence of any possible difference. Two equilibrium models with different grids, 2172 and 4042 mesh points, have been used, and the latter model includes an explicit modelling of semiconvection just outside the convective core. Comparing the results for these two models illustrates the effect of the number of mesh points and their distribution in particularly critical parts of the model, such as the steep composition gradient outside the convective core. A comprehensive study of the frequency differences found for the different codes is given as well. These differences are mainly due to the use of different numerical integration schemes. The use of a second-order integration scheme plus a Richardson extrapolation provides similar results to a fourth-order integration scheme. The proper numerical description of the Brunt-Vaisala frequency in the equilibrium model is also critical for some modes. An unexpected result of this study is the high sensitivity of the frequency differences to the inconsistent use of values of the gravitational constant (G) in the oscillation codes, within the range of the experimentally determined ones, which differ from the value used to compute the equilibrium model.Comment: 18 pages, 34 figure

arXiv.org e-Print Archive

Crossref

HAL-UNICE

HAL AMU

University of Birmingham Research Portal

HAL-INSU

HAL Descartes

Open Repository and Bibliography - Liège

HAL-OBSPM

Hal-Diderot

Perspectives in Global Helioseismology, and the Road Ahead

Author: A. Claverie
A. Claverie
A. Eff-Darwich
A. Serebryanskiy
A.-M. Broomhall
A.F. Lanza
A.G. Kosovichev
A.H. Gabriel
A.N. Cox
A.S. Brun
B. Dintrans
B.N. Andersen
C. Allende Prieto
C. Allende Prieto
C. Brans
C. Charbonnel
C. Fröhlich
C.-H. Lin
C.A. Iglesias
C.A. Iglesias
D. Georgobiani
D.-Yi. Chou
D.O. Gough
D.O. Gough
E. Covas
E.A. Spiegel
E.J. Rhodes Jr.
E.J. Rhodes Jr.
E.J. Rhodes Jr.
F. Baudin
F. Pijpers
F.-L. Deubner
F.J. Rogers
G. Glatzmaier
G. Houdek
G. Houdek
G.A. Verner
H. Ando
H. Yoshimura
H.C. Spruit
H.M. Antia
H.M. Antia
H.M. Antia
H.M. Antia
H.M. Antia
I.W. Roxburgh
I.W. Roxburgh
J. Christensen-Dalsgaard
J. Christensen-Dalsgaard
J. Christensen-Dalsgaard
J. Christensen-Dalsgaard
J. Christensen-Dalsgaard
J. Christensen-Dalsgaard
J. Christensen-Dalsgaard
J. Christensen-Dalsgaard
J. Christensen-Dalsgaard
J. Montalbán
J. Montalbán
J. Provost
J. Schou
J.A. Guzik
J.N. Bahcall
J.N. Bahcall
J.N. Bahcall
J.R. Elliott
J.T. Schmelz
J.W. Leibacher
K. Belkacem
K. Belkacem
K.G. Libbrecht
M. Asplund
M. Asplund
M. Asplund
M. Castro
M. Emilio
M. Küker
M. Rempel
M. Rempel
M. Schüssler
M.-A. Dupret
M.F. Woodard
M.J. Thompson
M.J. Thompson
M.S. Miesch
N. Grevesse
N. Grevesse
N.R. Badnell
P. Delache
P. Demarque
P. Demarque
P. Eggenberger
P. Kumar
P.A. Gilman
P.A. Gilman
P.A. Young
P.C. Scott
P.R. Young
Q.R. Ahmad
Q.R. Ahmad
R. Howard
R. Howe
R. Howe
R. Howe
R. Howe
R. Howe
R. Howe
R. Samadi
R. Samadi
R. Samadi
R. Wachter
R.A. García
R.A. García
R.A. García
R.F. Stein
R.H. Dicke
R.K. Ulrich
R.K. Ulrich
R.K. Ulrich
R.W. Komm
S. Basu
S. Basu
S. Basu
S. Basu
S. Basu
S. Basu
S. Basu
S. Basu
S. Couvidat
S. Lefebvre
S. Lefebvre
S. Mathur
S. Sofia
S. Talon
S. Turck-Chieze
S. Turck-Chieze
S.C. Tripathy
S.G. Korzennik
S.G. Korzennik
S.J. Jiménez-Reyes
S.M. Jefferies
S.M. Tobias
S.N. Ahmed
S.V. Vorontsov
Sarbani Basu
T. Appourchaux
T. Appourchaux
T. Appourchaux
T. Corbard
T. Corbard
T.L. Duvall Jr.
T.L. Duvall Jr.
T.M. Brown
T.M. Brown
T.M. Brown
T.R. Ayres
W. Däppen
W.A. Dziembowski
W.A. Dziembowski
W.J. Chaplin
W.J. Chaplin
W.J. Chaplin
W.J. Chaplin
W.J. Chaplin
W.J. Chaplin
W.J. Chaplin
William J. Chaplin
Y. Elsworth
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

We review the impact of global helioseismology on key questions concerning the internal structure and dynamics of the Sun, and consider the exciting challenges the field faces as it enters a fourth decade of science exploitation. We do so with an eye on the past, looking at the perspectives global helioseismology offered in its earlier phases, in particular the mid-to-late 1970s and the 1980s. We look at how modern, higher-quality, longer datasets coupled with new developments in analysis, have altered, refined, and changed some of those perspectives, and opened others that were not previously available for study. We finish by discussing outstanding challenges and questions for the field.Comment: Invited review; to appear in Solar Physics (24 pages, 6 figures

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

University of Birmingham Research Portal

EviRank: An Evidence Based Content Trust Model for Web Spam Detection

Author: F. Provost
F.J. Provost
H. Zhang
I.H. Witten
W. Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

Reinventing Machine Learning with ROC Analysis

Author: D.M. Green
F.J. Provost
J. Fürnkranz
J.P. Egan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Crossref

Learning Compact Markov Logic Networks With Decision Trees

Author: D. Fierens
F.J. Provost
H. Blockeel
H. Zhang
J. Neville
R. Kohavi
Publication venue
Publication date: 01/01/2012
Field of study

Statistical-relational learning combines logical syntax with probabilistic methods. Markov Logic Networks (MLNs) are a prominent model class that generalizes both first-order logic and undirected graphical models (Markov networks). The qualitative component of an MLN is a set of clauses and the quantitative component is a set of clause weights. Generative MLNs model the joint distribution of relationships and attributes. A state-of-the-art structure learning method is the moralization approach: learn a set of directed Horn clauses, then convert them to conjunctions to obtain MLN clauses. The directed clauses are learned using Bayes net methods. The moralization approach takes advantage of the high-quality inference algorithms for MLNs and their ability to handle cyclic dependencies. A weakness of moralization is that it leads to an unnecessarily large number of clauses. In this paper we show that using decision trees to represent conditional probabilities in the Bayes net is an effective remedy that leads to much more compact MLN structures. In experiments on benchmark datasets, the decision trees reduce the number of clauses in the moralized MLN by a factor of 5-25, depending on the dataset. The accuracy of predictions is competitive with the models obtained by standard moralization, and in many cases superior

CiteSeerX

Crossref

University of Queensland eSpace

A Study with Class Imbalance and Random Sampling for a Decision Tree Learning System

Author: F.J. Provost
G. Batista
G.M. Weiss
J. Laurikkala
N.V. Chawla
T. Fawcett
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Crossref

Receiver operating characteristic curves and confidence bands for support vector machines

Author: Cai T.
Jiang B.
Pepe M.S.
Platt J.
Provost F.
Provost F.J.
R Core Team
Steinwart I.
Tibshirani R.
Vapnik V.
Veropoulos K.
Publication venue: 'Wiley'
Publication date
Field of study

Crossref